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ABSTRACT 

We have studied topology of the distribution of the high redshift galaxies identified 
in the Hubble Deep Field (HDF) North and South. The two-dimensional genus is 
measured from the projected distributions of the HDF galaxies at angular scales from 
3.8" to 6.1". We have also divided the samples into three redshift slices with roughly 
equal number of galaxies using photometric redshifts to see possible evolutionary 
effects on the topology. 

The genus curve of the HDF North clearly indicates clustering of galaxies over the 
Poisson distribution while the clustering is somewhat weaker in the HDF South. This 
clustering is mainly due to the nearer galaxies in the samples. We have also found that 
the genus curve of galaxies in the HDF is consistent with the Gaussian random phase 
distribution with no significant redshift dependence. 

Subject headings: cosmology: observation — galaxies: large-scale structure, topology 

1. INTRODUCTION 

An important prediction of typical inflationary models is that the matter fluctuation field 
has a Gaussian random phase distribution. At large linear scales where the galaxy distribution 
is presumely still in the linear regime and therefore keeping the statistics of the primordial 
fluctuation field, studies of the topology of the galaxy distribution can test such predictions. At 
smaller non-linear scales where the galaxy distribution depends sensitively on the small-scale 
physics, a study of topology can give us information on the mechanism of galaxy formation and 
evolution as well as on cosmology. 

During the past 15 years topology of large scale structures has been measured by many 
authors using various observational samples following after the initial work of Gott, Melott & 
Dickinson (1986), Hamilton, Gott & Weinberg (1986), and Gott, Weinberg & Melott (1987). 
Among these, topology measures for two-dimensional fields have been introduced by Coles & 
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Barrow (1987), Melott et al. (1989) and Gott et al. (1990), and applied to observational samples 
like the angular distribution of galaxies on the sky (Coles & Plionis 1991; Gott et al. 1992), the 
distribution of galaxies in slices of the universe (Park et al. 1992; Colley 1997) and the temperature 
fluctuation field of the Cosmic Microwave Background (Coles 1988; Gott et al. 1990; Smoot et 
al. 1994; Kogut et al. 1996; Colley, Gott & Park 1996; Park et al. 1998). In the first two cases 
where the samples dominantly include nearby galaxies, the topology of two-dimensional galaxy 
distributions as revealed by the genus or the Euler-Poincare characteristic statistics, is consistent 
with the random phase Gaussian distribution with a possible weak 'meat-ball' topology. 

The HDF images (Williams et al. 1996) taken by the Wide Field Planetary Camera 2 
(WFPC-2) on the Hubble Space Telescope, have given us an unprecedently deep view of the high 
redshift universe. One can easily detect objects with AB magnitudes (Oke 1974) down to 28 ~ 29 
(Madau et al. 1996) and with an angular resolution of about 0.05". Important issues one can 
address from the HDF data include the discovery of proto-galaxies forming at high redshifts, or 
delineating the epoch of galaxy formation (Steidel et al. 1996; Clements & Couch 1996; Park & 
Kim 1998). Another important problem one can study from the HDF data is the properties of 
galaxies at high redshifts, or the evolution of galaxies. This can be done by looking at numbers, 
colors, and clustering of galaxies as a function of redshift. 

In this paper we study the topology of the distribution of galaxies identified in the HDF North 
and South and having either spectroscopic or photometric redshifts (Fernandez-Soto, Lanzetta, & 
Yahil 1999, hereafter FLY99; Lanzetta et al. 1999). From these data sets we hope to study the 
topology of the galaxy distribution at high redshifts. 



We use the two-dimensional genus statistic introduced by Melott et al. (1989) as a quantitative 
measure of topology of the galaxy distribution in the HDFs. The study most similar to our work is 
that of Gott et al. (1992) who have measured the genus of angular distributions of nearby galaxies 
in the UGC and ESO catalogs. Coles k, Plionis (1991) have also measured the Euler-Poincare 
characteristic, which is equivalent to the genus in the limit of negligible sample boundary effects, 
for the Lick catalog. 

The two-dimensional genus is defined as (Gott et al. 1992) 

G = (number of isolated high density regions) — (number of isolated low density regions). (1) 

When a two-dimensional distribution is given, the Gauss-Bonnett theorem relates the genus of 
isodensity contours with the line integral of the local curvature k (Gott et al. 1990) 



2. Topology Measure 



2.1. 



The Genus 




(2) 
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In the case of a Gaussian random field the genus per unit area is known to be (Melott et al. 1989; 
Coles 1988) 



1 (fc^) 



where v is the threshold density for the isodensity contour in units of standard deviations from 
the mean, (A;^) = / k'^P2{k)(Pkl J P2{k)dFk and P2{k) is the smoothed two-dimensional power 
spectrum. In practice, we are not interested in the one-point distribution of the density field, and 
therefore use the label ua which parametrizes the area faction by 



We calculate the genus from i^^ = — 3 to 3 with an interval of 0.2. 

When the power spectrum of the three-dimensional galaxy distribution is of the power-law 
form P^ik) oc and when the thickness of the two-dimensional slice is much larger or smaller 
than the smoothing length of the Gaussian filter W{r) oc e~''^/^^G, there exist simple analytic 
formulae for the amplitude (Melott et al. 1989) 

(k^) = ^, (5) 

where F{n) = {n + 2)/2 for n > — 2 and for — 3 < n < —2 for slices with thickness much larger 
than the smoothing length Rq, and F{n) = 1 for n > — 1 and (n -|- 3)/2 for — 3 < n < — 1 for thin 
slices. The thick slice approximation is relevant for maps of galaxies projected on the sky. 



2.2. The Genus-Related statistics 

Since the genus-threshold density relation for Gaussian fields is known, non-Gaussian 
behaviour of a field can be detected from deviations from the relation. We quantify such deviations 
by genus-related statistics. The first is the shift of the genus curve A.u (Park et al. 1992). We 
measure the shift parameter from the observed genus curve by minimizing the between the 
data and the fitting function 

G = Az/'e-^"/2, (6) 

where v' = v — Az/ and the amplitude A of the genus curve is allowed to have different values at 
negative and positive v' . The ^^-minimization is performed over the range —1.0 <v< 1.0. 

The second statistic is the asymmetry parameter which measures the difference in the 
amplitude of the genus curve in the positive and negative thresholds (i.e. the difference between 
the numbers of clumps and voids). The asymmetry parameter is defined as 



^g = Ac- Ay, 



(7) 
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where 

Ac= gobsdv/ / g^Adu, (8) 

and likewise for Ay. The integration is limited to —2 < v < —0.4 for Ay and to 0.4 < v < 2 
for Ac- The overall amplitude ^ of the best-fit genus curve ^fit is found from the x^-fitting 
over the range —2.0 < v < 2.0. Ag" is positive when high-density regions are divided into many 
clumps while the low-dcnsity regions are merged into fewer voids. For an observed genus curve, we 
therefore measure the best-fit amplitude A, the shift parameter Ai/, and the asymmetry parameter 
A5- 



3. Analysis of the HDF Data 

3.1. The HDF data 

We use the photometric redshift data of galaxies in the HDF North (hereafter HDFN) 
published by FLY99. The catalog has 1067 galaxies with photometric and/or spectroscopic 
redshifts. We use the spectroscopic redshifts whenever available. We limit the sample to z < 2, 
which leaves us with 820 galaxies because the redshift space distribution of galaxies sharply drops 
at z ~ 2 and because the photometric redshift starts to have relatively large error at z > 2 
(FLY99). We then drop the Planetary Camera image as well as the edges of the Wide Field 
Camera images where the magnitude limit to the sample is bright, i.e. ^5(8140) = 26.0. 

In Figure 1 the long-dashed lines delineate the inner part of the WFPC2 images which encloses 
714 galaxies with AB(8140) < 28.0 (hereafter the S-zone; see dash lines in Fig. 1 of FLY99). The 
genus is actually measured in the region 100 pixels inside these boundaries (hereafter the G-zone; 
thick solid lines in Figure 1) after the galaxy distribution is smoothed in the S-zone. There are 605 
galaxies with z <2 and AS (8140) < 28.0 in the G-zone. To see the possible evolution effects we 
divide the HDFN sample into three overlapping redshift slice subsamples, each of which contains 
about 300 galaxies in the G-zone. Table 1 lists the definitions of these subsamples together with 
the total sample. The middle slice overlaps with the first and the third ones. 

The photometric redshift data in the HDF South (HDFS) field has been obtained from a 
web page^ (Lanzetta et al. 1999; Yahata et al. 2000). This catalog is close to complete down 
to AB{SIAQ) < 28.0 (Fernandez-Soto 2000). The original catalog contains 1275 redshifts. The 
magnitude limit of AB{814:0) = 28.0 and redshift limit of z = 3 leave us with 727 galaxies, and 
614(530) of these galaxies are within the S(G)-zones, respectively. This whole HDFS sample is 
further divided into three redshift space slices with roughly 265 galaxies each as described in 
Table 1. Figure 2 shows the HDFS galaxies satisfying the magnitude and redshift limits, and the 
boundaries defining the S- and G-zones. 



^ http : / /www. ess . sunysb .edu/astro/hdfs 
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3.2. Results 

To measure the genus the discrete distribution of galaxies must be smoothed by an appropriate 
filter. We first maJie a mask which has the value 1 within the S-zone and outside. We ignore the 
small regions contaminated by stars and relatively big galaxies in the HDFs because taking into 
account those regions to the mask turns out to have negligible effects on the genus results because 
our smoothing lengths are large compared to them. Wc first smooth the mask over a smoothing 
length using the Gaussian filter. At the same time the distribution of the HDF galaxies in the 
S-zone are smoothed over the same length and divided by the smoothed mask to yield the smooth 
galaxy density field (Melott et al. 1989). Then the genus is measured from the smoothed density 
array only within the G-zone. We use the C0NT0UR2D code (Weinberg 1988) to measure the 
genus. The code has been modified so that the genus at each threshold level is an average over 
three genus values with the threshold levels v shifted by and ±0.03. 

We have chosen the Gaussian smoothing radius Rq as the half-width at half maximum of 

the Gaussian function, and set Rq = dj \/ 21n 2 = 0.849(J. This is slightly smaller than the mean 
separation d, but larger than the 'e-folding smoothing length' Ag = dj^pl = 0.707J used in some 
studies. 

Figure 3 and 4 show the genus curves (filled dots) for our HDFN and HDFS samples and 
their best fitting Gaussian genus curves (solid curves), respectively. Error bars are estimated 
from 20 bootstrap resamplings of the galaxies. Small open circles are the genus curves averaged 
over 100 realizations of Poisson distributions with the same number of galaxies. In Table 2 we 
summarize the genus-related statistics A, Az/, and measured from each HDF subsample. The 
68% uncertainty limits arc again estimated from the genus-related statistics measured from 20 
bootstrap resamplings of galaxies. The mean amplitude of the genus curves from 100 Poisson 
realizations of the distribution of the mock HDF galaxies in each sample is also included. 

4. Discussion 

The genus curves of the HDF samples shown in Fig. 3 and 4 and their statistics listed in 
Table 2 indicate that the distribution of the HDF galaxies is consistent with a Gaussian random 
phase distribution because most subsamples have shift and asymmetry parameters consistent with 
zero. The only statistically significant behaviour of the genus curves is their lower amplitudes 
compared to the corresponding Poisson distributions. In the north samples this coherence of the 
galaxy distribution is mainly caused by the shallow slice with < z < 1.1. The fact that the 
genus curve has an amplitude significantly lower than the Poisson one indicates that the smoothed 
distribution of the HDF galaxies does have a real signal. 

Therefore, even though the our HDF samples are radial projections of galaxies in very 
long thin rods, the galaxy distribution in each sample is not merely a projection of statistically 
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independent galaxies, but maintains a finite clustering signal. To check if this should be the 

case, we need to know the angular covariance function (CF) of galaxies at high redshifts. Gott 
and Turner (1979) have found that the angular CF of galaxies at the present epoch continues 
inward without any break with a slope of 0.8 (i.e. w{6) oc 9^^'^) down to the smallest scale 
measured, a comoving scale of 0.0033 Mpc. For comparison at the median redshift z = 1.1 
of the HDFN (for an Einstein-de Sitter cosmology) the smoothing lengths Rq = 3.8" and 5.4" 
correspond to comoving scales of 0.034/i~^ Mpc and 0.049/i~^ Mpc, respectively (or over an order 
of magnitude larger). Assuming that the CF is constant in the comoving space (The CF does not 
grow greatly from z = 1.1 to the present in the flat lambda model with Q„i = 1/3 and I^a = 2/3 
popular today, for example.), from Gott and Turner's present epoch CF from the Zwicky catalog 
w{e) K 17.3(6l/l')"°-^ we estimate 

w{e) = O.494(0/1.7")-°-S = (0/O.7O")-°•^ (9) 

as the depth of the sample changes from = 53h~^ Mpc for the Zwicky catalog to = 1860^"^ 
Mpc for the HDFN sample. Then the average fractional excess number counts of galaxies within 
a smoothing area around a galaxy is approximately 

(^> = w5i ^(*)2.« = -(iic/0.70»)-«». (10) 

Dince Eg = 3.8" for the HDFN, (^) = 0.43. For comparison, a Poisson distribution would have a 
RMS fluctuation of (^) = = {Rq / d)-'^ / ^/tt = 0.66 where we used N = ttR%/^ given the 

mean galaxy separation li. Therefore, we expect the signal-to-noisc ratio in the HDFN sample to 
be about 0.65, so there still remains some detectable physical clustering of galaxies in the field. A 
lower limit on the signal-to-noise ratio could be established by assuming that clusters present at 
z = 1.1 remained at the same physical size to the present, representing a growth of the amplitude 
CF in comoving coordinates proportional to where a = 1/(1 + z). That would give a 

S/N = 0.27. 

We may also estimate the signal-to-noise ratio in the smoothed maps directly from the 
topology statistics. The amplitude of the genus curve is proportional to F{n) from equation (5). 
The noise contribution from a Poisson distribution of galaxies corresponds to a power-law index 
n = and F{n) = 1. If the signal is characterized by an angular CF with w{6) oc 6^^'^. This 
corresponds to a power-law index n = —1.2 and F{n) = 0.4. Thus if we were observing pure signal 
we would expect A = 0.4ylpoisson- If we were observing pure noise we would expect A = Apoisson- 
We actually observe for the HDFN a value of A = 0.7774poisson suggesting, by linear interpolation, 
a value of S/N = 0.62. Repeating this calculation for the HDFS where we observe A = 0.86Apoisson 
gives a S/N = 0.30, both being consistent with the back-of-the-envelope calculations given in the 
previous paragraph. 

We have also found that the shift parameter in Table 2 is consistent with zero shift. 
Actually the HDFN-1 and HDFS samples show significant bubble {Au > 0) and meat-ball 
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shifts, respectively. But the shift averaged over different subsamples is consistent with zero. 

The asymmetry parameter is also consistent with zero. Even though the HDFN-3 sample has a 
significant excess of the genus curve amplitude in the positive thresholds (more clusters than voids 
at the same volume fraction), the mean asymmetry parameter for all subsamples is still within 
one standard deviation from zero. 

This work was supported by the KOSEF grant (1999-2-113-001-5) and the NSF grant 
AST-9900772. CBP would like to thank Canadian Institute for Theoretical Astrophysics for the 
hospitality during this work. The authors thank Dr. Michael Vogeley for helpful comments and 
Dr. Alberto Fernandez- Soto for sending us unpublished results on the HDFS photometric redshift 
data. 
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Fig. 1.— The 820 galaxies (dotts) with < z < 2.0 in the HDF North. The dashed hnes mark 
the region where the galaxy distribution is smoothed, and the solid lines show the borders of the 
region where the genus is measured. Contour lines represent the ua = +2 (thick solid line), +1 
(solid), (thin solid), —1 (light dotted), and —2 (heavy dotted) iso-density contours of the galaxy 
number density smoothed over 3.8" or 95 pixels. Coordinates are the pixel numbers in the HDF 
mosaic image. 
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Fig. 2. — The 727 galaxies (dotts) with < z < 3.0 in the HDF South. The mosaic image 
is rotated by 4.76° clockwise. Superposed are the iso-density contours of the galaxy distribution 
smoothed over 4.4" or 110 pixels. 
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HDFN 




Fig. 3. — The genus curves (filled dotts) of the galaxy distributions of the HDF North subsamples. 
The solid lines are the Gaussian genus curves best fit to the observed data. The open circles are 
the average genus curve of 100 Poisson distributions with the same number of points as the number 
of galaxies in each HDF subsample. 




Fig. 4. — Same as Fig. 3 but for the HDF South subsamples. 
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Table 1. The HDF subsamples 



Samples 


z 


^med 


iVg(S/G-zone) 


Rg 


HDFN 


0.00 < z< 2.00 


1.10 


714/605 


3.8" 


HDFN-1 


0.00 <z< 1.10 


0.65 


368/307 


5.4" 


HDFN-2 


0.65 <z< 1.60 


1.10 


365/305 


5.4" 


HDFN-3 


1.10 <z< 2.00 


1.60 


346/298 


5.4" 


HDFS 


0.00 <z< 3.00 


1.16 


614/530 


4.4" 


HDFS-1 


0.00 < z< 1.16 


0.56 


300/204 


6.1" 


HDFS-2 


0.65 <z< 2.04 


1.16 


310/364 


6.1" 


HDFS-3 


1.16 <z< 3.00 


2.04 


314/266 


6.1" 
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Table 2. Genus-related statistics for the HDF subsamples 



Samples A ^Poisson Az^ - Afpoisson Ay - A^Poisson 



HDFN 20.3 ±2.3 26.5 -0.01 ± 0.08 -0.22 ±0.17 

HDFN-1 9.1 ±1.5 13.2 ±0.30 ± 0.10 ±0.21 ± 0.21 

HDFN-2 11.4 ±1.5 13.2 -0.01 ±0.11 -0.14 ±0.19 

HDFN-3 11.5 ±2.7 13.2 ±0.03 ± 0.07 ±0.35 ±0.13 

HDFS 19.2 ± 1.9 22.4 -0.15 ±0.07 -0.12 ±0.14 

HDFS-1 10.6 ±0.9 11.7 -0.17 ±0.10 -0.31 ± 0.16 

HDFS-2 7.2 ±1.2 11.7 -0.25 ± 0.15 -0.24 ± 0.28 

HDFS-3 12.0 ±1.1 11.7 ±0.03 ±0.11 -0.19 ±0.16 



